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Abstract 


The use of synthetic aperture radar (SAR) images represents a valuable 
source of information to characterize geographic regions susceptible to 
flooding, such as southeastern Mexico, they are not sensitive to cloudy 
and / or dark conditions. This research presents a methodology to identify 
bodies of water in a region of southeastern Mexico. Three machine 
learning algorithms were implemented: Random forests (RF), Gradient 
Boosting (GB) and Support Vector Machines (SVM) to classify three target 
classes: Class A (water, flooded areas, and bodies of water); class I 
(urban infrastructure and / or bare soil), and class V (vegetation) from 
SAR images. The SAR image used covers a projected geographical area 
UTM Zona 15 Norte WGS84 located in the states of Tabasco and Chiapas; 
this was pre-processed to reduce errors in the image. The RF, GB and 
SVM models were implemented in Python language. These were trained 
and tested in prediction from a database of 12 000 samples with 
amplitude values of the SAR image. The RF model obtained an overall 
classification accuracy (PG) of 97.9 (+/- 0.003) %; GB obtained PG = 97.9 
(+/- 0.003) %, and SVM PG = 97.4 (+/- 0.005). The three models 
obtained an F1_s value higher than 0.99 to predict class A; RF obtained 
AUC = 1 for the three target classes. This study shows the potential use 
of SAR satellite images and the high performance of RF, GB and SVM 
machine learning models to classify and identify water bodies as well as 


highlighting its importance in studies of possible impacts of floods. 
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Resumen 


El uso de imágenes de radar de apertura sintética (SAR) representa una 
fuente valiosa de información para caracterizar regiones geográficas 
susceptibles de inundaciones, como en el sureste de México, ya que éstas 
no son sensibles a condiciones de nubosidad y/u oscuridad. En esta 
investigación se presenta una metodología para identificar cuerpos de 
agua en una región del sureste de México. Se aplicaron tres algoritmos 
de aprendizaje automático: bosque aleatorio (RF), potenciación del 
gradiente (GB) y máquina de soporte vectorial (SVM) para clasificar las 
tres clases objetivo A: agua, áreas inundadas y cuerpos de agua; 1: 
infraestructura urbana y/o suelo desnudo, y V: vegetación a partir de 
imágenes SAR. La imagen SAR utilizada cubre una zona geográfica 
proyectada UTM Zona 15 Norte WGS84, localizada en los estados de 
Tabasco y Chiapas, la cual fue preprocesada para disminuir errores en la 
imagen. Los modelos RF, GB y SVM se implementaron en lenguaje Python, 
que fueron entrenados y probados en predicción a partir de una base de 
datos de 12 000 muestras, con valores de amplitud de la imagen SAR. El 
modelo RF obtuvo una precisión global de clasificación (PG) de 0.979(+/- 
0.003); GB obtuvo PG = 0.979(+/-0.003), y SVM PG = 0.974(+/-0.005). 
Los tres modelos obtuvieron un valor de F1_score superior a 0.99 para 


predecir la clase A; el clasificador RF obtuvo valores de 4UC = 1 para las 
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tres clases objetivo evaluadas. Este estudio permite mostrar el uso 
potencial de las imágenes satelitales SAR y el alto desempeño de los 
modelos de aprendizaje automático RF, GB y SVM para clasificar e 
identificar los cuerpos de agua, así como resaltar su importancia en 


estudios de los posibles impactos de las inundaciones. 


Palabras clave: aprendizaje supervisado, modelos de predicción, 
imágenes satelitales, árboles de decisión, cuenca hidrológica, sensores 


remotos, curvas ROC. 
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Introduction 


Today, synthetic aperture radar images obtained by remote sensors are 
available. Unlike optical sensors, radar images do not depend on reflected 
solar radiation or on heat radiation emitted by the earth. Rather, they 


emit their own electromagnetic radiation for their probes. Therefore, a 
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radar image is not affected by meteorological conditions or darkness 
(Sami € Abdulmunem, 2020; Fernández-Ordoñez 8: Soria-Ruiz, 2015). 
Satellite images cover large areas and even areas of difficult access. From 
their analysis, it is possible to extract information of interest indirectly. 
Processing radar images is a challenge in terms of dealing with 
deformations and noise caused by inclination of the sensor. However, 
overall accuracy of the classifier is conditioned by the level of processing 
of the data to reduce mottling in simple and multitemporal images, and 
by the characteristics of the images used as predictors, such as texture, 
color, object to be classified, agricultural areas, bodies of water, urban 


areas, and the number of target classes (Gomarasca et al., 2019). 


There are different approaches to classifying an image. Election of 
a specific approach depends on the nature of the object to be detected 
and the quantity and quality of the data. Machine learning algorithms 
have good predictive capacity, according to Avendaño-Pérez, Parra-Plazas 
and Fredy-Bayona (2014), who reported the use of support vector 
machine (SVM) with Gaussian kernel and a Bayesian model to classify 
water, soil, and population. They point out that performance of the SVM 
model was superior to the Bayesian model. Pulella, Aragáo-Santos, Sica, 
Posovszky and Rizzoli (2020) mapped forested areas in the state of 
Rondonia, Brazil, with a random forest (RF) model trained with Sentinel- 
1 multitemporal radar images to classify artificial, forested, and non- 
forested areas; they obtained an overall classification accuracy of more 
than 80 %. 
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Radar images have been used in diverse fields of study. They have 
been used, for example, to identify oil spills, monitor sea ice, detect sea- 
going vehicles, classify ground cover, monitor soil moisture, and detect 
flooded areas. Shen, Wang, Mao, Anagnostou and Hong (2019) presented 
an ample review on the advantages and disadvantages of the use of these 
images to map flooded extensions. These authors reported acceptable 
accuracy in machine flood mapping with no obstructions. However, 
detection of floods under vegetation in urban areas was less satisfactory. 
Lin, Yun, Bhardwaj and Hill (2019) described a study for flood detection 
in urban areas from multitemporal Sentinel-1 satellite images after 
hurricane Matthew in 2016 in North Carolina, USA, and obtained low 
precision. In a similar manner, Zhang et al. (2018) reported the use of 
Sentinel-12 and Sentinel-1B multitemporal data for zoning floods induced 


by hurricane Irma in Florida, USA, in 2017. 


In southeastern Mexico periods of intense rains occur frequently in 
the months of August, September, October, and November, causing 
floods and economic losses in different sectors of the population and 
occasionally, losses of human life (Sánchez, Salcedo, Florido, 8 Mendoza, 
2015). ). For this reason, it is of interest to have reliable indirect methods 
of identifying areas affected by floods since in situ identification can be 
costly and slow. In these situations, satellite images and supervised 
machine learning models can be of great use in the classification of areas 


affected by floods. 
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The state of Tabasco must deal with floods because of geographic 
factors such as the presence of Mexico's two largest rivers (the 
Usumacinta and the Grijalva), mean annual precipitation of 2426 mm, 
lack of territorial planning, deforestation of the high parts of the 
watershed, climate change, and anthropic factors (Arreguín-Cortés € 


Rubio-Gutiérrez, 2014; Perevochtchikova 8 Lezama-de-la-Torre, 2010). 


The objective of this study was to apply three machine learning 
algorithms (gradient boosting, random forest, and support vector 
machine) for zoning a geographic region in the states of Tabasco and 
Chiapas into three target classes: class A (flooded areas and bodies of 
water, class I (urban infrastructure and/or bare soil, and class V 
(vegetation), from synthetic aperture radar images. An additional 
objective was to illustrate the potential use of radar images to detect 
bodies of water. The radar image we analyzed corresponds to a scenario 
caused by the flood that occurred in the study area on October 8, 2017, 
in the state of Tabasco, Mexico. 
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Materials and methods 


Study area 


The study area is delimited by the red polygon seen in Figure 1, with 
reference to the World Geodetic System 1984 (WGS84) of geographic 
coordinates, with a projection to the UTM (Universal Transverse Mercator) 
system of coordinates. The synthetic aperture radar (SAR) image covers 
a large part of the state of Tabasco and a small part of the state of 
Chiapas. The study area includes 408 687 ha of this image. 
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Figure 1. Study area, Tabasco and Chiapas, SRC UTM Zone 15 North 


WGS84. 
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Input data 


To conduct this study, we obtained SAR images from the Sentinel-1A 
satellite. The images were downloaded through the WEB site of the 
Institute of Geophysics of the University of Alaska, Fairbanks (ASF, 2020). 
Table 1 presents the specific characteristics of the SAR image. 


Table 1. Characteristics of the SAR image. Source: Copernicus Sentinel 
Data (2017). 


Concept Description 
Product S1A_IW_GRDH_1SDV_20171008T 
Date of acquisition 8-OCT-2017 12:00:16.264908 
Product level 1, Standard georeferenced product 
Acquisition method IW (Interferometric Wide) 
Azimuth band width 327 HZ 
Product type GRD (Grand Range Detection) 
Polarization Dual VW + VH 
Frequency Band C 
Step Descendent 


Digital processing of the images was performed with the free access 
software SNAP (sentinel application platform). The Sentinel 1 toolbox 
facilitates SAR image processing (ESA 8 SEOM, 2019). The SAR images 


were transformed to a matrix format with Matlab software (MathWorks 
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Inc., 2016), and geospatial processing of the SAR images was performed 
with QGIS software (QGIS.org, 2020). The machine learning models for 
classification were implemented in Python language with the Scikit-learn 
library (Pedregosa et al., 2011). Processing of images, data and models 
was carried out with a computational system in the 64-bit environment 


Windows 10, Intel Core ¡5 processer 02.50 GHz, and 8 GB RAM. 


SAR image preprocessing 


Prior to their analysis, the SAR images were preprocessed with SNAP. 


Figure 2 shows the preprocessing stages of the image. 
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Figure 2. Preprocessing stages of the synthetic aperture radar (SAR) 


image. 
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The correction process consisted of defining a subsection of the 
original image that depends on the area of interest and the available 
computational processing capacity. To this subsection, radiometric 
correction was applied to eliminate the most visible mottles of the image; 
a Lee filter with parameters look = 1, and window size 3x3 is used. A 
geometric correction was then applied to georeference the image and 
facilitate its geographic visualization. Finally, the images were exported 
to a TIF graphic format (Abdurahman-Bayanudin 8, Heru-Jatmiko, 2016; 
Podest, 2018; UN-SPIDER, 2020). 


Sample selection 


The database was created based on samples of the SAR image associated 
with three target classes: class A (flooded areas and bodies of water), 
class 1 (urban infrastructure and/or bare soil, and class V (vegetation). 
The samples of the image highlight the characteristics of interest, such as 


rivers, lagoons, swamps, cities, highways, vegetation, and bare ground. 


Class A comprises bodies of water: perennial rivers, intermittent 


rivers, lakes, lagoons, and flooded agricultural areas. The pixels of this 
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class were selected based on convenience sampling. The boundaries of 
the bodies of water were detected by two segmentation methods. The 
first, the regional growth method, consists of applying a dissimilarity 
parameter (d = 0.035) from a seed pixel to obtain a binary image with 
values equal to 1 for the pixels (x, y) similar to the seed and O otherwise. 
The second, the threshold method, consists of analyzing the histogram of 
the image to define the values of the upper and lower thresholds that 
segment the objects in the best possible way. This method generates a 
binary image with pixels (x, y) equal to 1 if they are found within the 


interval of the previously defined thresholds, and O otherwise. 


Class A was created with the width values of the bands sigma0_VH 
and sigma0_VV for each pixel (x, y). These pixels were then exported to 
a text file with the coordinates (x, y) of each pixel and its sigma0_VH and 
sigma0_VV values. 


Class I represents the physical infrastructure, which includes 
highways, buildings, houses, stores, factories, and soil with little or no 
vegetation that are visible in the Google satellite hybrid auxiliary image. 
To define this class, convenience sampling was used; to each sample of 
the image, binary segmentation was performed using a manually defined 
threshold. The sample of large cities that is ¡llustrated in format uínt8 was 
segmented with the threshold interval of 0.35 — 255. The bands sigma0_VH 
and sigma0_VV were exported for the pixels (x, y) with values equal to 
1, resulting in segmentation. 
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Class V represents vegetation: forests, grasslands, agricultural 
area, and jungle. This class is identified by means of a convenience 
sampling that consists of defining small windows that, in their totality, 
represent a type of vegetation. Then, by means of segmentation by 
regional growth (d = 0.5), the matrix of image data is accessed to extract 
the values of the bands sigma0_VH and sigma0_VV., 


Machine learning algorithms 


Three machine learning algorithms were selected for multiclass 


classification based on the characteristics of the SAR image bands. 


Random forest model 


The random forest model (RF) proposed by Breiman (2001) consists of a 


combination of predictors of decision trees in which each tree depends on 
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the values of a random independent vector with equal distribution for all 
the trees of the forest. Later, a large number of decision trees is 
generated, and the most frequently predicted class is selected. RF is 
defined as a cClassifier that comprises a set of structured tree 
classifiers (h(x, Ok), k = 1,...), where Ok are independent random vectors, 
identically distributed, and each tree emits a unitary vote for the most 
frequent class, given an x entry. Geurts, Ernst and Wehenkel (2006), in 
the construction of a decision tree, proposed a replica of a learning sample 
bootstrap and the CART (classification and regression trees) algorithm 
together with the modification used in the sub-spatial method. In each 
test node, the optimal division is obtained by searching for a k-size subset 


of entry variable candidates (selected without replacement). 


In the division of a node in a decision tree (in classification 
problems), the most used loss function is the index of Gini (i¿(t)) defined 
as (Gini, 1912): 


L= i5(0) = Ef PCC — p(Cglt)) (1) 


where p(C,|t) is the proportion of samples that belong to class k, and J 


represents the total number of classes for a particular node t. 
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Gradient boosting model 


The gradient boosting model (GB) is defined by a set of decision trees 
that are trained sequentially using an iterative computational procedure 
where each subsequent model corrects the errors of its predecessor, 


resulting in a more robust model (Friedman, 2001). The loss function to 
be optimized is expressed as follows: 


LY mes Fr (609115) = — ía yr 108 Py (3) (2) 


where y, €f0,1) k=1,...,K classes, and  P,(x)= Pr(y, = 1|x).  Logistic 


regression (sigmoid function) is often used to find the probability that 
input x generates y = 1. 


Support vector machine (SVM) 


A support vector machine (SVM) is considered an extension of the 


perceptron model (Raschka 8 Mirjalili, 2017). SVM maximizes the margin, 
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that is, the distance between the separation hyperplane (decision limit) 


and the training samples next to this hyperplane called support vectors 


(Figure 3). 
Margin 
N Support vectors 
X2 X2 W 
E 
Se Decision limit 
SS NN + + 
O AN . * + " 
O Negative X ati: 
—————  hyperplane 
o) po hyperplane EN 
O e) NN WX=-1 0 O WRASA 
A. SVM: ds 
Which hyperplane? Maximize margin 


Figure 3. Description of the search for the maximum separation 
hyperplane between classes in a support vector machine (SVM). Source: 
Raschka and Mirjalili (2017). 


Vapnik (1995) introduced the concept of looseness (¿,¿*) to soften 
the linear restrictions and converge in optimization of problems that are 
not linearly separable. The loss function consists of minimizing the L 
function (Equation (3)) subject to the restrictions (Equations (4), (5), and 
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(6)). The hyperparameter C controls the width of the margin of separation 
between classes (Smola 8 Schólkopf, 2004). 


L= min (5IIwil? + CXL(é +6") (3) 
1-WMX)-5S 245 (4) 
(wx)+b=y Se +f”, (5) 
Ep E*,20 (6) 


where (f(x,,y1), ..., (X, y )) is the set of training data contained in space X; 
(w, x¡) denotes the product point between the vector of weights w and the 


inputs x; in X; and b is a variable that represents the bias. 


The objective is to obtain a function that has a maximum deviation 
e Of the target classes y; obtained for all the training data and ¡is as flat as 
possible, flat means the search for the smallest w. To guarantee this 
condition, the minimum of the norm Jlw]l? = (w,w) is obtained; this is 


transformed into a problem of convex optimization (Equation (3)). 
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Classifier training 


To train the classifiers RF, GB, and SVM, a k-fold cross-validation (CV) 
was performed; this consists of subdividing the original data set into k 
different subsets. For k = 10, ten combinations of training and test data 
sets are generated in a proportion of 9:1 (MathWorks Inc., 2021). With 
these subsets, the predictive capacity of the classifiers is evaluated. This 
technique permits avoiding problems of overfitting the algorithms to the 
training data. Likewise, stratification is carried out to obtain the same 
proportions of classes in each selected training and test subset. When 
training is finalized, k performance values (PG) of the model and the 
average PG are obtained (Raschka, 2018). 


Selection of classifier hyperparameters 


Optimal selection of the hyperparameters (fitting parameters that do not 
depend on the input data) of each classifier was carried out in two stages. 


The first was selection on a validation graph that describes the variation 
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of the classification error in function of the values of each selected 
hyperparameter. In the second stage, selection is fine-tuned through a 
grid search that consists in defining a search range for each 
hyperparameter (Table 2). This allows selecting the combination of 
hyperparameter values that obtains the best performance of the 


algorithm. 


Table 2. Hyperparameter search ranges for the random forest (RP), 


gradient boosting (GB), and support vector machine (SVM) models. 


Model Hyperparameter Range 
Number of estimators (N) (1, 500) 
BE Maximum depth (MP) (1, 46) 


Minimum samples per node (MMD)  |(2, 10) 


Minimum samples per leaf (MMH) (1, 10) 


Learning rate (TA) (0.0001, 10) 
Es Number of estimators (N) (1, 800) 
Minimum samples per leaf (MMH) (1, 10) 
Maximum depth (MP) (1, 15) 
Kernel (K) rbf, poly, s, | 
SvM Parameter of penalization (C) (0.1, 100) 
Gamma (G) (0.1, 1) 


rbf: Gaussian, poly: polynomial, s: sigmoid, and /: linear. 
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The RF and GB ensemble models are based on the standard 
algorithm of decision trees, whose structure can be controlled by the 
definition of hyperparameters: number of trees or estimators (N), 
minimum number of samples to be considered a node (MMD), minimum 
number of samples to be considered a leaf (MMH), and maximum 
permitted depth limit (MP), that is, no additional division of this value is 
allowed (Swamynathan, 2017). The learning rate (TA) refers to the 
magnitude of the updates of the GB algorithm; a low value delays 
convergence time with a high probability of selecting a local minimum 
value, and a high value increases the possibility of skipping the global 
minimum solution. Selection of this hyperparameter is experimental and 
with previous knowledge of similar problems (Raschka € Mirjalili, 2017). 
In SVM, C is the hyperparameter that balances between minimizing the 
looseness variables and maximizing the margin. Moreover, SVM 
implements functions of mapping or kernels to augment the dimension of 
the problem and find a separation plane, which without this 
transformation would not be possible to visualize. The most frequent 
kernel is Gaussian (rbf, radial basis function) with ¡ts Gamma 


hyperparameter (Patle 8 Chouhan, 2013). 
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Metrics of performance assessment 


To assess the performance of the RF, GB and SVM classifiers, the following 
metrics were used: global precision of the correct classification (PG), 
Precision (P), Sensitivity (S), F1_score (F1s) and the AUC area under the 
ROC (receiver operating characteristic curve). These metrics are obtained 
from a confusion matrix that describes the results of the classifier in a 
double input table, which shows the predicted values (columns) and 
observed values (rows) of each class (Figure 4). The confusion matrix 
allows visualization of the classifier's hits and misses (Barrero-Ortiz, 
2019). 
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(FP) Negativos (VN) 


Figure 4. Description of a confusion matrix for the case of binary 


classification. 


PG = 1-—Ec measures the overall proportion of correct classifications, Ec 


is the classification error; PG is calculated by: 


PG = VPN (7) 


— VP+VN+FP+FN 


Where VP is the number of positive samples that are correctly 
classified; VN is the number of negative samples that are correctly 


classified; FP is the number of negative samples that are incorrectly 
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classified as positive; and FN is the number of positive samples that are 
incorrectly classified as negative; S is the proportion of cases that are 
correctly predicted as positive, relative to the total observed positive 


values and is defined as: 


VP 


En VP+FN (8) 


P is the proportion of VP relative to the total predicted positive values and 


is calculated as (Bradley, 1997): 


VP 
do VP+FP (9) 


Fis is the harmonic mean of S and P and is expressed as: 


Fls = (10) 


where Fls varies in the interval (0, 1). 


The ROC measures the discriminating capacity of the model to 
differentiate the classes and compares the discriminating capacity of two 
or more models. In a typical ROC graph, each point of the curve 


corresponds to a possible cutting point of the model and shows its 
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respective S values (Y axis) and TFP = FP/(FP + VN) (rate of false positives 
on the X axis). AUC measures how well the model discriminates samples 
that belong to a class A from others that do not, along the entire interval 
of possible cutting points (Parikh, Mathai, Parikh, Chandra-Sekhar, 8 
Thomas, 2008; Fawcett, 2006). 


Results and discussion 


Radar image processing 


Figure 5 shows the results of processing the radar image of the study 
area. Radiometric calibration for the VH bands (Figure 5A) and VV (Figure 
5C) standardizes the values of retro-dispersion. Lee filters were used to 
correct the salt and pepper noise as well as later It was applied a 
geometric correction to WGS84 UTM zone 15 north of the VH bands 
(Figure 5B) and VV (Figure 5D). 
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Figure 5. Radar image processing, VH polarization: A) radiometric 


calibration, B) mottle correction and geometric calibration; VV 
polarization: C) radiometric calibration and D) mottle correction and 


geometric calibration. 
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Selected and labeled samples 


As the result of processing and analyzing the SAR images, we obtained 
an integral database with 12 000 samples, 4000 in each target class (A, 


I, and V), purged of repeated and inconsistent samples 


Hyperparameters of the classifiers 


Graphic analysis 


The optimal hyperparameters of the RF, GB and SVM models selected in 
the first stage based on the graphic analysis of classification error curves 


with cross-validation (k = 10) are described in Table 3. 
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Table 3. Optimal values of the hyperparameters in the first stage of the 
random forest (RF), gradient boosting (GB) and support vector machine 


(SVM) models; average classification error (Ec) and standard deviation 


(Std). 
Model Hyperparameter | Optimum Ec Std 
N 200 0.024 0.004 
MP 9 0.022 0.003 
RF 
MMD 10 0.021 0.003 
MMH 1 0.021 0.003 
TA 0.08 0.022 0.003 
N 200 0.022 0.003 
GB 
MMH 1 0.022 0.003 
MP 1 0.022 0.004 
K rbf 0.027 0.004 
SvVM C 120 0.027 0.004 
G 1 0.026 0.005 


Figure 6 describes the behavior of the hyperparameters N, MP, MMD 
and MMH of the RF classifier. The individual effect of each hyperparameter 


on the average classification error (Ec). The analysis of the Ec graphs 
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allows visualization of the performance behavior and selection of the 


hyperparameter with the minimum Ec value (Figure 6). 


Figure 6. Behavior of the random forest (RF) model classification error 


in function of the hyperparameters A) N, B) MP, C) MMD, and D) MMH. 
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Grid Search 


To fine-tune the search for optimal values of the hyperparameters of each 
model, a grid search was carried out with cross-validation (k=10), and 


the best combinations with the smallest classification error were selected. 


The RF model with values N = 200, MP = 9, MMD = 10 and MMH = 1 
obtained a PG = 0.979 (+/-— 0.003). The GB model with values TA = 0.08, N = 
200, MMH = 1 and MP = 1 obtained a PG = 0.979 (+/-— 0.003), while the SVM 
model with values K=rbf, C=120 and G=1 obtained a PG= 
0.974 (+/— 0.005). 


Based on the grid search, the optimal values of the 
hyperparameters were different from the values selected with the Ec 
graph analysis and improved PG performance of the three classifiers. 
These combinations of optimal hyperparameters were used in the 
evaluation, comparison and final prediction of the RF, GB and SVM 


classifiers. 
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Performance assessment 


The assessment metrics of classifier performance obtained by cross- 
validation are presented in Table 4. The performance evaluation was 
carried out with a test set of 1200 samples (400 per class). The three 
classifiers performed better in identification of bodies of water (class A); 
RF and GB outperformed SVM slightly with F1s of 99.3 %. In contrast, 
AUC = 1 for the three classifiers means that they are excellent classifiers 
with adequate selection of the balance threshold between S and P. The 
high performance achieved for class A ¡is due to ¡its separability from the 
rest, with values of retro dispersion defined in the interval (-24.0, -22.0) 
(Fernández-Ordoñez et al., 2020). 
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Table 4. Performance metric assessment by target class, of classifiers 
random forest (RF), gradient boosting (GB), and support vector machine 
(SVM) based on the test set (10 % of the data), with k = 10 cross- 


validation). 
Model Ss P Fis AUC 
Class A 
RF 0.998 0.988 0.993 1.00 
GB 0.995 0.990 0.993 1.00 
SVM 0.995 0.988 0.991 1.00 
Class I 
RF 0.963 0.980 0.971 1.00 
GB 0.965 0.977 0.971 1.00 
SVM 0.950 0.987 0.968 1.00 
Class V 
RF 0.980 0.973 0.976 1.00 
GB 0.980 0.973 0.976 0.99 
SVM 0.990 0.961 0.975 1.000 


In identification of physical infrastructure (class 1), RF and GB 
performed better than SVM, with Fis = 97.1 % versus 96.8 %. Regarding 


identification of vegetation (class V), RF and GB performed better than 
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SVM, with Fis = 97.6 % versus 97.5 %. In terms of AUC, the three 
classifiers performed equally. The performance of the analyzed algorithms 
decreases due to the confusion between class 1 and class V, caused by 
the similarity of the retro dispersion values, translating into false negative 


predictions for both classes (Figure 7). 


—— AUC = 1.00 Clase A 
—— AUC = 1.00 Clase I 
—— AUC = 1.00 Clase V 


0.8 1.0 


A I V 
Predicción 


Figure 7. Confusion matrix and ROC of prediction performance of the 
random forest (RF) classifier based on the test set (10 % of the data), 


with k = 10 cross-validation. 


To obtain the best estimatorPG, a k = 10 cross-validation was 
carried out. Figure 8 describes the PG behavior of the classifiers RF, GB 
and SVM. Average PGis represented by a black dot inside each box. RF 
achieved PG = 0.979 (+/-0.003), GB obtained PG = 0.979 (+/-0.003), 
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and SVM attained PG = 0.974(+/-0.005). These results show that average 
performance of the three classifiers is very similar to that obtained with a 
single test set. Likewise, the ensemble classifiers RF and GB performed 
slightly better than SVM. For practical purposes, the three classifiers 


achieved high overall classification accuracy in prediction. 


' 
' 
' 
' 
l 
1 Y Y 


GB RF SVM 


Figure 8. Box plot comparing overall accuracy (PG) obtained by cross- 
validation (k = 10) of the classifiers gradient boosting (GB), random 


forest (RF), and support vector machine (SVM). 
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Study area zoning 


The input samples were classified with the trained RF, GB and SVM 
classifiers and their optimal hyperparameters. The input and output TIF 
images are georeferenced with the World Geodetic System (WGS84, 
World Geodetic System 1984) projected in UTM zone 15 north. Figure 9 
illustrates the classification of a subsection of the study area that covers 
46 842.8 ha (11.5 % of the total area). For each target class A, 1, and V, 
RF classified 3.6, 8.3 and 88.1 %; GB classified 3.7, 8.0, and 88.3 %; and 
SVM classified 3.7, 15.6, and 80.7 %, respectively. For class A, the 
percentages of area calculated by the three models were similar. For class 
I, however, SVM duplicated the estimated area, relative to that estimated 
by RF and GB. Figure 10 presents the classification of the study area in 
its entirety (408 687.1 ha). The RF model obtained A = 15 139.2 ha (3.7 
%), 1 = 30 318.8 ha (7.4 %) ha, and V = 363 229.1 ha (88.9 %). 
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Figure 9. Classification of bodies of water (A), infrastructure and bare 
soil (1), and vegetation (V) with random forest (RF), gradient boosting 


(GB), and the support vector machine (SVM) models in a section of the 
study area located in the states of Tabasco and Chiapas, Mexico. 
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Figure 10. Classification of bodies of water (A), infrastructure and bare 
soil (1), and vegetation (V) with the random forest (RF) model in the 


study area between the states of Tabasco and Chiapas, Mexico. 


Figure 9 confirms the high performance of the models (RF, GB and 
SVM) in identifying bodies of water with free area and little aquatic 
vegetation. The limits of the perennial rivers Grijalva and Usumacinta are 
perfectly delineated, as are the lagoons with their respective increment in 
extension. The machine learning models allowed identification of the 


urban areas of Villahermosa flooded by the intense rainfall on October 6, 
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7 and 8, 2017, that caused the rivers to overflow. The performance of the 
classifiers used in this study is slightly better than that obtained by Chen, 
Huang, Chen and Feng (2021), who used a standard adaptative threshold 
method to automate identification of flooded areas, obtaining a PG of 95- 
97 %. In a similar way, the work of Hlavácová, KacmarFík, Lazecky, 
Struhár and Rapant (2021), reported PG = 83 % mainly due to the study 
area and the automated approach that these authors used. The three 
classifiers correctly identified class 1, which corresponded to the urban 
area of Villahermosa. Nevertheless, they predict a high rate of false 
positives that correspond to class V. This error is duplicated with the SVM 
model (Figure 9). The number of identification errors increases when the 


prediction is realized in the entire study area (Figure 10). 


Conclusions 


The machine learning classifiers random forest (RF) and gradient boosting 
(GB) had better prediction performance than the classifier support vector 
machine (SVM) in identifying bodies of water from synthetic aperture 


radar (SAR) images. RF and GB had an overall average classification 
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accuracy (PG) of 97.9 % and SVM PG = 97.4 %. The three models 
obtained an F1_score above 99.3 % in the prediction of target class A 
(water); 97.6 % for class V (vegetation), and 97.1 % for class I 


(infrastructure). 


The high prediction performance of the three classifiers is due, 
among other reasons, to the fact that the target classes (water, 
vegetation, and soil) were balanced, that the samples of the SAR image 
associated with each class were separable with the values of the radar 
bands, and that the grid search of the hyperparameters of each model, 


based on cross-validation reduced classification errors. 


The prediction of areas covered by water with a global accuracy of 
99.2 % by the classifiers RF and GB from SAR images shows the potential 
use of these images in conducting studies related to detection of bodies 
of water, monitoring, and evaluation of flood damage. Overall, models 
were highly precise, although they were slightly less precise in 


distinguishing the infrastructure and vegetation classes. 
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